Flexible clustering via hidden hierarchical Dirichlet priors
نویسندگان
چکیده
The Bayesian approach to inference stands out for naturally allowing borrowing information across heterogeneous populations, with different samples possibly sharing the same distribution. A popular nonparametric model clustering probability distributions is nested Dirichlet process, which however has drawback of grouping in a single cluster when ties are observed samples. With goal achieving flexible and effective method both observations, we investigate prior that arises as composition two discrete random structures derive closed-form expression induced distribution partition, fundamental tool regulating behavior model. On one hand, this allows gain deeper insight into theoretical properties and, on other it yields an MCMC algorithm evaluating inferences interest. Moreover, limitations working more than populations consequently, devise alternative efficient sampling scheme, by-product, testing homogeneity between populations. Finally, perform comparison process provide illustrative examples synthetic real data.
منابع مشابه
Infinite Hidden Markov Models via the Hierarchical Dirichlet Process
Category: graphical models. In this presentation, we propose a new formalism under which we study the infinite hidden Markov model (iHMM) of Beal et al. [2]. The iHMM is a hidden Markov model (HMM) in which the number of hidden states is allowed to be countably infinite. This is achieved using the formalism of the Dirichlet process. In particular, a two-level urn model is used to determine the ...
متن کاملHierarchical Dirichlet Process Hidden Semi-Markov Models
Given a set of sequential data in an unsupervised setting, we often aim to infer meaningful states present in the data along with characteristics that describe and distinguish those states. For example, in a speaker diarization (or who-spoke-when) problem, we are given a single audio recording of a meeting and wish to infer the number of speakers present, when they speak, and some characteristi...
متن کاملFlexible Priors for Exemplar-based Clustering
Exemplar-based clustering methods have been shown to produce state-of-the-art results on a number of synthetic and real-world clustering problems. They are appealing because they offer computational benefits over latent-mean models and can handle arbitrary pairwise similarity measures between data points. However, when trying to recover underlying structure in clustering problems, tailored simi...
متن کاملSpike train entropy-rate estimation using hierarchical Dirichlet process priors
Entropy rate quantifies the amount of disorder in a stochastic process. For spiking neurons, the entropy rate places an upper bound on the rate at which the spike train can convey stimulus information, and a large literature has focused on the problem of estimating entropy rate from spike train data. Here we present Bayes least squares and empirical Bayesian entropy rate estimators for binary s...
متن کاملDetecting Abnormal Events via Hierarchical Dirichlet Processes
Detecting abnormal event from video sequences is an important problem in computer vision and pattern recognition and a large number of algorithms have been devised to tackle this problem. Previous state-based approaches all suffer from the problem of deciding the appropriate number of states and it is often difficult to do so except using a trial-and-error approach, which may be infeasible in r...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Scandinavian Journal of Statistics
سال: 2022
ISSN: ['0303-6898', '1467-9469']
DOI: https://doi.org/10.1111/sjos.12578